Data Provenance Architecture Supporting Environmental Monitoring Processes

نویسندگان

  • Daniel Lins da Silva
  • André F. M. Batista
  • Pedro Luiz Pizzigatti Corrêa
چکیده

Long-term research and environmental monitoring are essential for the improved management of ecosystems and natural resources. However, to reuse this data for new experiments, decision-making processes, and integrate these data with other long-term initiatives, scientists need more information related to data creation and its evolution, intellectual property rights, and technical information in order to evaluate the use of this data. Provenance metadata emerges as a way to evaluate the quality and reliability of data, audit processes and the data versioning, while enabling the data reuse and the reproducibility of experiments and analysis. However, most solutions for the capture and management of provenance metadata are based on specific tools, restricted scopes, and they are difficult to apply in distributed and heterogeneous environments. In this paper, we present an approach for capturing, managing, and publishing the provenance metadata generated in the environmental monitoring processes. Our computational architecture comprises three main components: (1) a data model based in PROV-DM and Dublin Core; (2) a repository of RDF Graphs; and (3) a Web API that provides services for collecting, storing, and querying provenance metadata. We demonstrate the application of our approach and show its practical usefulness by evaluating this architecture to manage provenance metadata generated during an environmental monitoring simulation. The results show that our approach is effective in collecting and storing provenance metadata and allows the query of an entire provenance of datasets and data products, thus enabling reuse, discovery, and visualization of raw data, processes, and scientists involved in its generation and evolution.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Standardisation of Provenance Systems in Service Oriented Architectures

This White Paper presents provenance in computer systems as a mechanism by which business and e-science can undertake compliance validation and analysis of their past processes. We discuss an open approach that can bring benefits to application owners, IT providers, auditors and reviewers. In order to capitalise on such benefits, we make specific recommendations to move forward a standardisatio...

متن کامل

Recording Actor Provenance Data in Scientific Workflows

The concept of “actor” provenance data – essentially data that a client or service actor may assert about itself regarding an interaction, is presented. Actor provenance data can be combined with assertions of interaction to enable better reasoning within a provenance system. The need for recording and maintaining actor provenance data is discussed, along with the description of an architecture...

متن کامل

Modelling Provenance of Sensor Data for Food Safety Compliance Checking

The Internet of Things (IoT) is resulting in ever greater volumes of low level sensor data. However, such data is meaningless without higher level context that describes why such data is needed and what useful information can be derived from it. Provenance records should play a pivotal role in supporting a range of automated processes acting on the data streams emerging from an IoT-enabled infr...

متن کامل

Provenance in Systems for Situation Awareness in Environmental Monitoring

As environmental monitoring systems increasingly automate the collection and processing of environmental sensor network data, the technical components of such systems can automatically obtain and maintain higher levels of situation awareness—awareness of the monitored part of reality. In order to increase confidence in the correctness of situation awareness maintained by such systems it is impo...

متن کامل

Supporting Provenance in Service-oriented Computing Using the Semantic Web Technologies

(resources) are dynamically discovered and composed into workflows for problem solving, and later disbanded. This gives rise to an increasing demand for provenance, which enables users to trace how a particular result has been arrived at by identifying the resources, configurations and execution settings. In this paper we analyse the nature of service-oriented computing and define a new concept...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJWA

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2016